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Non Linear Elliptic Theory and 
the Monge- Ampere Equation 



Luis A. Caffarelli* 

Abstract 

The Monge- Ampere equation, plays a central role in the theory of fully 
non linear equations. In fact we will like to show how the Monge- Ampere 
equation, links in some way the ideas comming from the calculus of variations 
and those of the theory of fully non linear equations. 
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When learning complex analysis, it was a remarkable fact that the real part u 
of an analytic function, just because it satisfies the equation: 

UXX + Uyy = A M = 

(Laplace's equation) is real analytic, and furthermore, the oscillation of u in any 
given domain [/, controls all the derivatives of u, of any order, in any subset t/, 
compactly contained in U . 

One can give three, essentially different explanations of this phenomena, 
a) Integral representations (Cauchy integral, for instance). This gives rise to 
many of the modern aspects of real and harmonic analysis: fundamental solutions, 
singular integrals, pseudo-differential operators, etc. For our discussion, an impor- 
tant consequence of this theory are the Schauder and Calderon-Zygmund estimates. 

Heuristically, they say that if we have a solution of an equation 

Aij{x)DijU = 

and Aij (x) is, in a given functional space, a small perturbation of the Laplacian 
then DijU is actually in the same functional space as Aij. For instance, if [Aij] is 
Holder continuous (C"(C/)) and positive definite, we can transform it to the identity 
(the Laplacian) at any given point xq by an affine transformation, and will remain 
close to it in a neighborhood. Thus DijU will also be C°'{U). 
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b) Energy considerations. Harmonic functions, u, are also local minimizcrs of 
the Dirichlet integral 

E{v) = I {\/vfdx . 



That is, if we change u to in U <Z<Z U 

E{w)\fj > E{u)\o . 

This gives rise to the theory of calculus of variations (minimal surface, harmonic 
maps, elasticity, fluid dynamics). 

One is mainly concerned, there, with equations (or systems) of the form 

D,F,{Vu,X)^G . (1) 

For instance, in the case in which u is a local minimizer of 

E{u) = j T{Vu,X)dx 

(|^) is simply the Euler-Lagrange equation associated to E: 

F, ^ VpT . 

If we attempt to write (0) in second derivatives form, we get 

Fi^j{'sJu,X)DijU + ■■■ ^ . 

This strongly suggests that in order for the variational problem to be "elliptic" , 
like the Laplacian, Fi^j should be positive definite, that is T should be strictly 
convex. 

It also leads to the natural strategy of showing that Vu, that in principle is 
only in (finite energy), is in fact Holder continuous. Reaching this regularity 
allows us to apply the (linear) Schauder theory. 

That implies DijU is C"{U), thus Vu is C^'°'{U), and so on (the bootstrapping 
method). 

The difficulty with this approach is that solutions, u, are invariant under 
dialations of their graphs. 

This fact keeps the class of Lipschitz functions (bounded gradients) invariant. 
There is no reason, thus, to expect that this equation will "improve" under diala- 
tions. The fact that Vu is indeed Holder continuous is the celebrated De Giorgi's 
theorem, that solved the nineteenth Hilbert's problem: 

De Giorgi looked at the equation that first derivatives, Ua satisfy 

D,F,j{Vu)DjUa = 0. 

He thought of FijiVu) as elliptic coefficients Aij{x) that had no regularity 
whatsoever, and he proved that any solution w of 

DiAij{x)DjW = 
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was Holder continuous 

||w||c=(E7) < C\\w\\l2(^U) ■ 

Do Giorgi's theorem is in fact a linear one, but for a new invariant class of 
equations. No matter how the solution (and the equation) is renormalized, it stays 
far from the constant coefficient theory, and a radically new idea surfaces: if we have 
a class of hmctions for which at every scale, in some average sense, the function 
controls its derivatives (the energy inequality), further regularity follows. 

Finally, the third approach is 
c) Comparison principle. Two solutions mi, M2 of Au — cannot "touch without 
crossing" . That is, if U1 — U2 is positive it cannot become zero in some interior point, 
Xo, of U. 

Again, heuristically, this is because the function 

F{D^u) = Au = TracefD^M] 

is a monotone function of the Hessian matrix [Diju] and, thus, in some sense, we 
must have F{D'^u\) ">" F{D'^U2) at Xq (or nearby). 

The natural family of equations to consider in this context, is then 

F{D'^u) = 

for F a strictly monotone function of D^u. 

Such type of equations appear in differential geometry. For instance, the co- 
efficients of the characteristic polynomial of the Hessian 

P(A) = det{D'^u - XI) 

are such equations if we restrict D'^u to stay in the appropriate set of i?"^". If Aj 
denote the eigenvalues of D'^u 

Ci = Au = Aj (Laplace) 
C2 — A/jAj . . . 

C„ = JJ Aj = det D^u (Monge- Ampere) . 

In the case of C„ = det D'^u = YlXi is a. monotone function of the Hessian provided 
that all \i 's are positive. That is, provided that the function, u, under consideration 
is convex. 

If F{D'^u, X) is uniformly elliptic, that is, if F is strictly monotone as a func- 
tion of the Hessian, or in differential form, 

Fij{M)=Dm,,F 

is uniformly positive definite, then solutions of F{D'^u) are C^'"(C7). As in the 
divergence case, this is because first derivatives Ua satisfy an elliptic operator, 

Fij{D'^u)DijUo, = Q 
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now in non divergence form. As long as we do not have further information on D^u, 
we must think again of Fij as bounded measurable coefficients. 

The De Giorgi type theorem for aij{x)DijUa = is due to Krylov and Safanov, 
and states again that solutions of such an equation are Holder continuous. 

We point out that, again this result has "jumped" invariance classes. Rescaling 
of aij{x) does not improve them. Unfortunately, this is not enough to "bootstrap", 
as in the divergence case: The coefficients, Aij{x) = Fij{D'^u), depend on second 
derivatives. If we will manage to prove that D^u is Holder continuous, then, from 
equation (|), D^m would be C^'°'{U), i.e., u would be C^'"(J7) and we could improve 
and improve. 

To prove this, once more convexity reappears. If F(D^u) is concave (or con- 
vex) then all pure second derivatives are sub (or super) solutions of the linearized 
operator. This, together with the fact that D^u lies in the surface F{D^u), implies 
the Holder continuity of D^u, and, by the bootstrapping argument u is as smooth 
as F allows. 

The Monge-Ampere equation and optimal transportation 

We would like now to turn our attention to the Monge-Ampere equation 

det D^u ^YIK^ fix, u, Vu) . 

As pointed out before, the equation fits in the context of elliptic equations provided 
that we consider convex solutions. That is, provided that / is positive. Further 
log det D^u — J2 log Ai is concave as function of the and thus is a concave function 
oi D^u. Unfortunately det-D^it is not uniformly strictly convex. 
For instance if we prescribe 

det = J]^ A, = 1 

ellipticity deteriorates as one of the A's goes to infinity and some other is forced to 
go to zero. This difficulty is compensated by two fundamental facts. 

1) The rich family of invariances that the Monge-Ampere equation enjoys. 

2) Its "hidden" divergence structure. 

The divergence structure is due to the fact that det D^u can be thought of as 
the Jacobian of the gradient map: X — > Vw. Thus for any domain U 

[ detD^udx^Vol{\/u{U)). 
Ju 

But if [/ CC [/, u being convex implies that 

(Vm)|[7 < Coscu\u. 

This gives us a sort of "energy inequality" that controls a positive quantity of D^u 
by the oscillation of u: 

[ det D^u < C(C/,C/) (oscw)". 
Ju 
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The Monge-Ampere equation is invariant of course, under the the standard 

families of transformations: 

a) Rigid motions, R: 

det D'^u{Rx) = /(Ex), 

b) Translations: 

det D'^u{x + v) = f{x + v), 

c) Quadratic dialations: 

det D'^^u{tx) = f{tx). 

But also 

d) Monge-Ampere is invariant under any afBne transformation A, of determinant 
one: 

det D'^u{Ax) = f{Ax) . 

If / is, for instance, in one of the following classes: 

a) / constant, 

b) / close to constant (| / — 1| < e), 

c) / bounded away from zero and infinity (0 < ^ < / < cr), 

any of the transformations above gives a new u in the same class of solutions. 
For instance, if m is a solution of 

det D'^u = 1 

then, u{sx, ^y) is also a solution of the same equation. But this has dramatically 
"deformed" the graph of u. It is then almost unavoidable that there are singular 
solutions (Pogorelov). 

In fact, for n > 3, one can construct convex solutions u that contain a line their 
graph and are not differentiable in the direction transversal to that line, solutions 
of 

det D'^u = f{x) 

with / a smooth positive function. 

Fortunately, this geometry can only be inherited from the boundary of the 
domain. 

Theorem 0.1. If in the domain U C i?" 

a) ^ < det D'^u < a, 

b) u>0, 

c) The set T ~ {u ^ 0} is not a point, then T is generated as "convex combina- 
tions " of its boundary points 

F = convex envelope of Td dU . 
A corollary of this theorem is that 
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a) If we can "cut a slice" of the graph of u, with a hyperplane l{x) so that the 
support S oi {u — l)~ is compactly contained in U, then u is, inside S, both 
Qi,a j-gguiaj- anji ^igg C^'"- strictly convex, i.e., separates from any of its 
supporting planes with polynomial growth. 

This is the equivalent of Dc Giorgi's and Krylov-Safanov result (remember that 
the theorems were applied to the derivatives of the solutions of the non-linear 
equations under consideration). 

Note that by an afhne transformation and a dilation we can always renormalize 
the support of the "slice" S to be equivalent to the unit ball of ii": Si c 5 C B„. 

After this normalization, it is possible to reproduce for u all the classical 
estimates we had for the Laplacian: 

a) (Calderon-Zygmund). If / is close to constant (|/ — 1| < e), then D^u G 
LP{B-i/2) {p = goes to infinity when e goes to zero). 

b) If / e C*^'" (has up to k derivatives Holder continuous) then u e C''=+2.« (all 
second derivatives of u are C^'". 

Note that / plays, for Monge- Ampere, simultaneously the role of "right hand 
side" and "coefficients" due to the structure of its non-linearity. 

The Monge- Ampere equation and optimal transportation (the 
Monge problem) 

The Monge- Ampere equation has many applications, not only in geometry, but 
also in applied areas: optimal design of antenna arrays, vision, statistical mechanics, 
front formation in meteorology, financial mathematics. 

Many of these applications are related to optimal transportation and the 
Wasscrstein metric between probability distributions. In the discrete case, opti- 
mal transportation consists of the following. 

We are given two sets of k points in ii": Xi, . . . , X/. and Yi,. . . ,¥/-, and want 
to map the X's onto the F's, i.e., we look at all one-to-one functions Y{Xj). But 
we want to do so, minimizing some transportation costs 

c = J2c{y{Xj)-Xj) . 

j 

For our discussion C{X — Y) = ^\X — Y]"^. It is easy to sec that the minimizing 
map must be the gradient (subdifferential) of a convex potential (p. 

In the continuous case, instead of having /c-points we have two probability 
densities, ,f{X) dX and giY) dY and we want to consider those (admissible) maps 
Y(X) that "push forward" / to g. 

Heuristically that means that in the change of variable formula, we can sub- 
stitute 

g{Y{X))deiDxY{XY=^^f{X). 
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A weak formulation, substitutes the map Y{X), by a joint probability density 
Y) with marginals f{X) dX and giY) dY, i.e., 



f{Xo) = J dYHXo,Y), 
9{Yo)= J dxiy{X,Yo). 



(We don't ask the "map" to be one-to-one any more, the image of Xq may now 
spread among "many F's" . 

Among all such u, we want to maximize correlation 



/C = j(X, Y)dv{X, Y) 



or minimize cost 

C = J^\X-Yfdu{X,Y), 

\fC defines a metric, the Wasserstein metric among probability densities. 
Under mild hypothesis, we have the 

Theorem 0.2. The unique optim,al uq coneentrates in a graph (is actually a one- 
to-one map, Y{X)). Further Y{X) is the suhdifferential of a convex potential ip, 
i.e., Y{X) = V</?- Heuristically, then, must satisfy the Monge- Ampere equation 

g{VLp) doti:»V = f{x). 

For several reasons, the weak theory does not apply in general, but one can 
still prove, for instance: 

Theorem 0.3. // / and g never vanish or if the supports of f and g are convex 
sets, the map Y{X) is "one derivative better" than f and g. 



Some applications and current issues 

a) It was pointed out by Otto, that the Wasserstein metric can be used to 
describe the evolution of several of the classical "diffusion" equations: heat equation, 
porous media, lubrication. 

The idea is that a diffusion process for one equation with conservation of 
mass, consists of the balance of two factors: trying to minimize distance between 
consecutive distributions {u{x,tk) and u{x,tk+i)), plus trying to flatten or smooth 
(diffuse), u{x,tk+i). 

This fact has allowed to prove rates of decay to equilibrium in many of the 
classical equations, as well as a number of new phenomena. The fine relations be- 
tween the discrete and continuous problems is an evolving issue (rate of convergence, 
regularity of the discrete problems, etc.). 

b) Another family of problems, coming both from geometry and optimal trans- 
portation concerns the study of several issues on solutions of Monge- Ampere equa- 
tions in periodic or random media. 
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bi) Liouville type theorems: We start with a theorem of Calabi of Liouville 

type: Given a global convex solution of Monge- Ampere equation, detD^w = 1, u 
must be a quadratic polynomial. Suppose now that instead of RHS equal to one, 
we have a general RHS, f{x). Given a global solution, to discover its behavior at 
infinity we may try to "shrink it" through quadratic transformations: 

Us = e^u ^— ^ , satisfies AetD'^Ue = (^—^ ■ 

Suppose now that / averages out at infinity, for instance / is periodic. Then due 
to the "divergence structure" of Monge- Ampere should converge to a quadratic 
polynomial. 

Theorem 0.4. Given a RHS f{x), periodic, with average^ f = a 

i) Given any quadratic polynomial P with det D^P = a, there exists a unique 
periodic function w, such that 

det D'^{P + w) = f{x) 

(w is a "corrector" in homogenization language), 
a) Gonversely (Liouville type theorem): Given a global solution u, it must be of 
the form P + w. 

What arc the implications for homogenization? What can we say ii f[X, u, Vm) 
is periodic in X and u? What can we say if fuj{x) is random in XI 

^2) Vorticity transport: (2 dimensions) Again in the periodic context we 
seek a "vorticity density", p{X,t) periodic in X. At each time t, p generates a 
periodic "stream function" , tp{X, t) by the equation 

det(7 + £1^) = p . 

In turn, -0 generates a periodic velocity field v = —{iJyjfpx) that transports p: 

Pt + div{vp) = . 

Given some initial data po (x) , what can we say about p7 

If Po is a vorticity patch, po{x) — 1 + xn, does it stay that way? 

If we choose po, tpo so that po = -F(V'o), that is det I + D'^ipo = F{tpo), we have 
a stationary vorticity array, i.e., p{X,t) = pQ. 

What can we say, in parallel to the classic theory of rotating fiuids, or plasma, 
where det is substituted by Atp? 

c) Another area of research relates to optimal transportation as a natural 
"map" between probability densities. It has been shown that optimal transportation 
explains naturally interpolation properties of densities (of Brunn Minkowski type), 
monotonicity properties (like correlation inequalities that express in which way the 
probability density, g, is shifted in some cone of directions with respect to /), and 
concentration properties of g versus / (in which sense for instance, a log concave 
perturbation of a Gaussian is more concentrated than a Gaussian). 
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Of particular interest would be to understand optimal transportation as di- 
mension goes to infinity. Since convex potentials are very stable objects, this would 
provide, under some circumstances, an "infinite dimensional" change of variables 
formula between probability densities. 

d) Finally, one of my favorite problems is to understand the geometry of 
optimal transportation in the case in which the cost function C{X — Y) is still 
strictly convex, but not quadratic. In that case, the optimal map is still related to 
a potential that satisfies 

det{I + D{Fj{Vi;))) = ■ ■ ■ 

where Fj is now the gradient of the convex conjugate to C. 

At this point, wc have come full circle and we arc now in a higher hierarchy, 
in a sort of Lagrangian version of the Euler-Lagrange equation from the calculus of 
variations. 

In fact if we put an epsilon in front of D and linearize, 

det{I+eD{Fj{Vip))) = 1+e Tia.ce{D{Fj{Vip)))+0{e'^) = 1+e div Fj{Vip) +0{s'^). 
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L. Ambrosio and C. Villani for optimal transportation. 



